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Abstract 

The M0XD2 gene encodes a membrane-bound monooxygenase similar to dopamine-p-liydroxylase, and lias been 
proposed to be associated with olfaction. In this study, we analyzed M0XD2 genes from 64 mammalian species, and 
identified loss-of-function mutations in apes (humans, Sumatran and Bornean orangutans, and five gibbon species from the 
four major gibbon genera), toothed whales (killer whales, bottlenose dolphins, finless porpoises, baijis, and sperm whales), 
and baleen whales (minke whales and fin whales). We also identified a shared 13-nt deletion in the last exon of Old World 
cercopithecine monkeys that results in conversion of a membrane-bound protein to a soluble form. We hypothesize that 
the frequent inactivation and alteration of M0XD2 genes in catarrhines and whales may be associated with the evolution of 
olfaction in these clades. 
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Introduction 

Olfaction, the sense of smell, is an essential perception that 
animals use to explore their environment. Olfaction plays an 
important role in detecting predators, locating food, and 
identifying mating partners. Specifically, airborne odorant mole- 
cules are recognized and discriminated by olfactory receptors 
(ORs) in the olfactory epithelium [1]. Catarrhines, Old World 
primates of the infraorder Catarrhini, have a smaller olfactory 
bulb and olfactory epithelium compared with other mammals, 
which is likely due to the increased dependence on vision rather 
than olfaction in these species [2,3] . It was previously reported that 
OR genes exhibit relaxed selective constraints in apes (humans, 
chimpanzees, and orangutans), implying that the dependency on 
olfaction is reduced in apes [4] . However, there are no significant 
differences in the number of functional OR genes among New 
World monkeys. Old World monkeys, and apes, indicating that 
there is no direct link between the acquisition of fuU trichromatic 
vision and the degeneration of OR genes [5,6]. 

The olfactory apparatus is completely absent in toothed whales, 
which also have a high number of degenerated OR genes [5,7- 
10]. In contrast to toothed whales, baleen whales still maintain an 
olfactory apparatus, and have a high number of intact OR genes 
[7,8,1 1]. For example, the bowhead whale {Balaena mysticetus), a 
baleen whale species, has a histologically complex olfactory bulb, 
which comprises approximately 0.13% of the total brain weight 
[12]. These whales also possess a high proportion (51%) of intact 
olfactory receptor genes, and it proposed that bowhead whales can 
use olfaction to locate krill, which comprise a major part of the diet 
of certain bowhead whales [12]. Examination of the anatomical 



structure of the olfactory apparatus of minke whales [Balaenoptera 
acutorostrata) also suggested that these whales might maintain the 
olfactory system, and use it to detect airborne odorants from 
clouds of plankton on which they feed [13]. Nevertheless, the 
absence or reduction of an olfactory apparatus in some whales, 
especially in toothed whales, is likely due to their fuUy aquatic 
lifestyle and reliance on sophisticated vocal communication and/ 
or echolocation. 

The monooxygenase, DBH-like 2 {M0XD2) gene was proposed 
to be associated with olfaction because mouse Moxd2 is strongly 
expressed in the medial olfactory epithelium [14,15]. The 
M0XD2 gene comprises 13 coding exons, and encodes a protein 
with an endoplasmic reticulum (ER) signal peptide at its N- 
terminus, and a glycosylphosphatidylinositol (GPI) anchor signal at 
its C-terminus. The MOXD2 protein thus enters the secretory 
pathway and eventually becomes bound to the cell membrane by a 
GPTanchor. The membrane-bound MOXD2 protein contains 
"DOMON" (Pfam accession number PF03351), "Cu2_monooxy- 
gen" (PF01082), and "Cu2_monoox_C" (PF03712) domains. Two 
other genes, the dopamine- fi-hydroxylase [DBH) gene and the 
monooxygenase, DBH-like 1 (MOXDl) gene, encode proteins 
with the same domain organization [16,17]. These three genes 
were generated by duplication during bUaterian evolution [14]. 
Specifically, the DBH gene encodes soluble and membrane-bound 
monooxygenases present in the synaptic vesicles of postganglionic 
sympathetic neurons [18]. Through this, DBH converts dopamine 
to norepinephrine, both of which are neurotransmitters that 
transmit signals between nerve cells. It is probable that the 
function of MOXD2 is similar to that of DBH with respect to 
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neurotransmitter metabolism. Moreover, the mouse Moxd2 gene 
is strongly expressed in the medial olfactory epithelium [15], 
raising the possibility of its involvement in olfactory function. To 
date, however, there is no direct evidence of this functionality. 

We previously reported that the M0XD2 gene is mutated in 
humans {Homo sapiens), Sumatran orangutans (Pongo abelii), 
Bornean orangutans {Pongo pygmaeus), and rhesus macaques 
{Macaco mulatto) [14]. The human M0XD2 is inactivated due to 
the deletion of a genomic region containing exons 12 and 13, 
which occurred after human-chimpanzee divergence. The orang- 
utan M0XD2 has two nonsense mutations and a sphce site 
mutation, rendering it non-functional. Similarly, the M0XD2 of 
rhesus macaques has a frameshift mutation due to a 1 3-nt deletion 
in the last exon, which results in a C-terminal truncation. Finally, 
the western lowland gorilla {Gorilla gorilla gorilla) M0XD2 is 
intact but has an elevated dN/ dS ratio, implying relaxed selection. 

In this study, we examined M0XD2 genes from 64 mammalian 
species and found additional gene inactivation in gibbons and 
whales: the M0XD2 gene is absent in five gibbon species 
representing all four extant genera {Nomascus, Hoolock, Hylobates, 
and Symphalangus) due to a complete gene deletion; the M0XD2 
genes of six whale species, including four toothed and two baleen 
whale species, have many open reading frame (ORF)-disrupting 
mutations; and the sperm whale, a toothed whale species, 
completely lost the M0XD2 gene due to a genomic deletion. 
We hypothesize that the loss of functional M0XD2 genes in apes 
and whales may be associated with the evolution of olfaction in 
these clades. 

Materials and Methods 

Identification of M0XD2 gene sequences 

Mammalian M0XD2 gene sequences were identified by 
BLASTN searches (http://blast.ncbi.nlm.nih.gov/Blast.cgi) of 
the whole genome shotgun (WGS) contigs database in the 
National Center for Biotechnology Information (NCBI) (http:// 
www.ncbi.nlm.mh.gov), and by BLAT searches (http://genome. 
ucsc.edu/cgi-bin/hgBlat) of genome assemblies using the Univer- 
sity of California Santa Cruz (UCSC) Genome Browser web server 
(http://genome.ucsc.edu). The accession numbers of sequence 
data and genome assemblies used in this analysis are listed in 
Table 1. Genomic sequences were downloaded and aligned with 
the chimpanzee {Pan troglodytes) M0XD2 cDNA or that of closely 
related species using SIM4 to predict exon sequences [19]. 
Successfully matched exon sequences were extracted and concat- 
enated to generate virtual cDNA serjuences, which were translated 
to obtain protein sequences. cDNA and protein sequences as well 
as exon coordinates are provided in Data SI. 

To identify the M0XD2 gene cDNA sequences from species for 
which genome sequences wvtv not a\-ailable, we analyzed high- 
throughput serjuencing data a\ ailabk' in the NCBI Sequence Read 
Archive (SRA) (http://www.ncbi.nlm.nih.gov/sra). Genome se- 
quence data for the Bornean orangutan {Pongo pygmaeus), the 
aye-aye {Dauhentonia madagascariensis), the finless porpoise 
{Neophocaena phocaenoides), and the fin whale {Balaenoptera 
physalus), and transcriptome sequence data for the gray mouse 
lemur {Microcehus murinus) were obtained from the SRA. 
Sequences for M0XD2 gene exons were identified using a 
locally-installed BLASTN program (version 2.2.24+). Chimpanzee 
and bottlenose dolphin M0XD2 exons were used as queries for 
primates and whales, respectively. The BLASTN command line 
parameter was "-num_descriptions 10000 -num_alignments 
10000". Matched reads with an expectation (E) value of 1 or 
less were extracted. Paired-end reads were split to separate reads. 



CAPS (version date 12/21/07) was used to align and assemble 
sequences into contigs [20]. For the assembly of Bornean 
orangutan sequences, which were short and showed comparatively 
high heterogeneity, we used parameters "-k 0 -m 5 -n -1 -o 16 -p 
66" to relax the stringency. Otherwise, we used the default 
parameters. CAPS alignments for exons are presented in Data SI. 

To determine whether the M0XD2 gene loci were deleted in 
gibbon species, we examined WGS reads of the eastern hoolock 
gibbon {Hoolock leuconedys), the silvery gibbon {Hylobates moloch), 
the plicated gibbon {Hylobates pileatus), and the siamang 
{Symphalangus syndaclylus). The sequence for the chimpanzee 
MOXD2-PRSS58 genomic locus (panTro4 chr7 : 143686 156- 
14S725295) was downloaded from the UCSC Genome Browser 
database. Gibbon sequences matching the repeat-masked chim- 
panzee MOXD2-PRSS58 genomic locus were identified by 
BLASTN (version 2.2.24+) with the command line parameter "- 
num_descriptions 10000 -num_alignments 10000". Gibbon ge- 
nome reads that aligned the chimpanzee genomic sequence with 
90% sequence identity or greater, and with query coverage of 90% 
or longer, were selected as matches. Paired-end reads were split to 
separate reads. The genomic regions covered by gibbon sequences 
were plotted using gnuplot software (version 4.4 patchlevel S) 
(http://www.gnuplot.info). Additionally, the Bornean orangutan 
WGS reads were analyzed as a control. 

Sequence analyses 

SignalP 4.1 (http://www.cbs.dtu.dk/services/SignalP) was used 
to predict ER signal peptides [21], PrcdGPI (http://gpcr. 
biocomp.unibo.it/ predgpi) was used to predic:t GPI anchor signals 
[22], and the Pfam database (http://pfam.sanger.ac.uk) was used 
to predict domain organization [2S]. RepeatMasker (http: //www. 
repeatmasker.org) was used to search for repetitive elements in 
genomic sequences. Advanced PipMaker and MultiPipMaker 
(http://pipmaker.bx.psu.edu/pipmaker) were used to produce dot 
plots [24] . AU analyses were performed using each respective web 
server with the default settings. Multiple sequence alignments of 
cDNA, exon, or protein sequences were performed using locally- 
installed MUSCLE software (v3.8.31) (http://www.drive5.com/ 
muscle) with the default parameters [25] . 

Statistical analyses 

The ratio of nonsynonymous to synonymous substitution rates 
(dN/ dS, (o) was estimated by a likelihood method implemented in 
the codeml program of the PAML package (version 4.7a) [26]. We 
prepared two datascts: one for the catarrhine primate M0XD2 
gene, and the other for the whak; M0XD2 gene. 

For the catarrhine primate datasct, we collected coding 
sequences oi M0XD2 genes from 13 primate species: humans, 
chimpanzees, lowland gorillas, Sumatran orangutans, rhesus 
macaques, olive baboons, green monkeys, black-capped squirrel 
monkeys, common marmosets, Philippine tarsiers, aye-ayes, gray 
mouse lemurs, and Northern greater galagos. Sequences from the 
bonobo, Bornean orangutan, and crab-eating macaque were not 
included because they were almost identical to chimpanzee, 
Sumatran orangutan, and rhesus macaque sequences, respectively. 
Sequences of exons 12 and IS were removed from the alignment 
because they are absent in humans. The nonsense codon position 
in orangutan exon 8 was also removed. The final sequence dataset 
comprised 1494 nucleotides. Sequence data files, tree files, control 
files, and major results files for the codeml analyses are provided in 
Data S2. 

For the whale dataset, we collected M0XD2 coding sequences 
from six whales (killer whales, bottlenose dolphins, finless 
porpoises, baijis, minke whales, and fin whales), cows, and pigs. 
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Disrupted codon positions in the whale sequences caused by 
nonsense mutation, insertion, and deletion were excluded from the 
aligned sequence data. The final sequence dataset comprised 1791 
nucleotides. Sequence data files, tree files, control files, and major 
results files for the codeml analyses are provided in Data S3. 

Twice the log likelihood difference [2A(ln L)] and the degree of 
freedom (df) were calculated for likelihood ratio tests between 
selected models. The P values for likelihood ratio tests were 
calculated using pchisq function (with the option "lower.tail = 
FALSE") implemented in R software (version 2.14.1) (http:// 
www.r-project.org). 

Results 

Identification of M0XD2 genes in 64 mammalian species 

We previously identified M0XD2 genes in 12 mammalian 
species [14]. In our prior study, we showed that the human, 
Sumatran orangutan, and Bornean orangutan M0XD2 genes are 
non-functional due to multiple deleterious mutations. Similarly, 
the rhesus macaque M0XD2 gene had a 1 3-nt deletion in the last 
exon, resulting in a C-terminal truncation and loss of the GPI 
anchor signal. The M0XD2 gene was previously proposed to be 
associated with mammalian olfactory function, and we concluded 
that these mutations may be implicated in the alteration of 
olfaction. 

These results prompted us to perform the extensive analysis 
described herein on genome sequences of diverse mammalian 
species to investigate the molecular evolution of the M0XD2 gene, 
and its possible connection with olfaction. In this study, we 
analyzed genome assemblies, WGS contigs, and WGS and 
transcriptome reads to identify M0XD2 genes from 64 mamma- 
lian species, including members of the Euarchontoglires, Laur- 
asiatheria, Afrotheria, Xenarthra, and Marsupialia. A complete list 
and phylogenetic tree of taxa analyzed are shown in Table 1 and 
Figure 1, respectively. The geologic timescale of mammalian 
phylogeny is primarily based on the report of dos Reis et al. [27] 
and supplemented by other data [28-31]. 

The complete 1 3 coding exons of M0XD2 genes were identified 
in 53 species out of 64 reported in this study, 49 of which produced 
intact proteins (Data SI). The remaining 15 species, for which the 
M0XD2 gene exhibited deleterious mutations, consisted of 
humans, Sumatran orangutans, Bornean orangutans, northern 
white-cheeked gibbons, eastern hoolock gibbons, silvery gibbons, 
plicated gibbons, siamangs, killer whales, botdenose dolphins, 
fmless porpoises, baijis, sperm whales, minke whales, and fin 
whales (see Table 1 for the detailed list of mutations). As 
previously reported, the human and orangutan M0XD2 genes 
exhibit a deletion of the last 2 exons and multiple inframe 
nonsense codons, respectively [14]. Furthermore, the M0XD2 
gene is absent in the five gibbon species due to a genomic deletion 
(see below), while M0XD2 genes of the six whale species have 
multiple open reading frame (ORF)-disrupting mutations in their 
coding sequences. Lastly, the M0XD2 gene appeared to be absent 
in sperm whales (see below). 

Figure SI shows the multiple sequence alignment of 49 full- 
length MOXD2 proteins togc-ther with the partial human 
sequence, and the Sumatran orangutan sequence with inframe 
stop codons. Both marsupial and placental sequences were 
strongly conserved; specifically, the chimpanzee and the gray 
short-tailed opossum {Monodelphis domestica) MOXD2 proteins 
exhibited an 82.4% amino acid sequence identity. 

The MOXD2 protein was previously reported to contain an ER 
signal peptide and a GPI anchor signal at its N- and C -termini, 
respectively [14], with the exception of the rhesus macaque, in 



which the MOXD2 lacks a C-terminal GPI anchor signal due to a 
frameshift deletion mutation in the last exon. We analyzed the 49 
full-length protein sequences to determine if they had the 
predicted ER signal peptide and GPI anchor signal. All proteins 
were predicted to have an ER signal peptide at their N-termini, 
indicating that they enter the secretory pathway. However, all but 
four of the sequences were predicted to have a GPI anchor signal 
at their C-termini, namely, the rhesus macaque, the crab-eating 
macaque [Macaca fasckularis), the olive baboon [Papio anuhis), 
and the green monkey {Chlorocehus sahaeus). These Old World 
cercopithecine monkeys shared the same 1 3-nt deletion in the last 
exon, resulting in C-terminal truncation and loss of the GPI- 
anchor (see below). 

Complete deletion of the M0XD2 gene in gibbons 

We searched the northern white-cheeked gibbon (NoTnascus 
leucogenys) genome assembly "nomLeu3" available through the 
UCSC Genome Browser for a gibbon M0XD2 genomic sequence. 
There were no matching sequences in the current northern white- 
cheeked gibbon genome assembly. We then searched the NCBI 
sequence databases, including WGS contigs, expressed sequence 
tags, and the Trace Archive, but failed to find any matches. 

The absence of sequence data raised the possibility of complete 
deletion of the M0XD2 gene in the northern white-cheeked 
gibbon, or incomplete sequencing of the M0XD2 genomic locus. 
To further assess these explanations, we searched the northern 
white-cheeked gibbon genome sequence for the PRSS58 gene, 
which is located close to the M0XD2 gene in humans and other 
mammaKan species. We successfully located the northern white- 
cheeked gibbon PRSS58 genomic locus, which was orthologous to 
the MOXD2-PRSS58 genomic loci of humans and other 
mammalian species. We then compared the MOXD2-PRSS38 
genomic locus of the olive baboon and those of the rhesus 
macaque, northern white-cheeked gibbon, Sumatran orangutan, 
western lowland gorilla, chimpanzee, and human (Figure 2A), 
which revealed that the M0XD2 gene was indeed absent from the 
northern white-cheeked gibbon genome. Therefore, this deletion 
was not due to an incomplete northern white-cheeked gibbon 
genome, because there was no sequence assembly gap in its 
genomic region orthologous to the M0XD2 gene locus of other 
primates. Interestingly, the northern white-cheeked gibbon 
M0XD2 gene locus was not simply deleted, but was replaced 
with a DNA fragment rich in repetitive elements. 

Comparison of the MOXD2-PRSS58 genomic loci of the 
northern white-cheeked gibbon and the chimpanzee genome 
assemblies revealed that the 22.5 kb-long genomic fragment 
containing the entire chimpanzee M0XD2 gene was replaced with 
an 11.8 kb-long fragment in the gibbon genome (Figure S2A). 
There were multiple gibbon WGS trace data that spanned each of 
the two replacement boundaries, indicating that our evaluation of 
the northern white-cheeked gibbon genome sequence was not due 
to erroneous assembly (Figure S2B,C). 

There are four extant gibbon genera: Nomascus, Hoolock, 
Hylohates, and Symphalangus [32]. To determine whether the 
M0XD2 gene deletion in the Nosmacus species is shared with the 
other three gibbon genera, we examined WGS reads derived from 
four gibbon species: the eastern hoolock gibbon (a Hoolock 
species), the silvery gibbon and the plicated gibbon {Hylohates 
species), and the siamang (a Symphalangus species). When WGS 
reads were mapped to the chimpanzee MOXD2-PRSS58 
genomic locus, no WGS read mapped to the chimpanzee M0XD2 
gene segment (Figure 2B), indicating that the M0XD2 genomic 
locus was also deleted in these three gibbon genera. Because the 
M0XD2 gene is absent in all the four extant gibbon genera, the 
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Great apes 



Bats 



Marsupialia 



Homo sapiens (1) 
Pan troglodytes (2) 
Pan paniscus (3) 
Gorilla gorilla gorilla (4) 
Pongo abelii (5) 
Pongo pygmaeus (6) 
Nomascus leucogenys (7) 
Hoolock leuconedys (8) 
Hylobates moloch (9) 
Hylobates pileatus (10) 
Symphalangus syndactylus (11) 
Macaca mulatta (12) 
Macaco fascicularis (13) 
Pop/o anubis (14) 
Chlorocebus sabaeus (15) 
Saimiri boliviensis (16) 
Callithrix jacchus (17) 
Tarsius syrichta (18) 
Microcebus murinus (19) 
Dauhentonia madagascariensis (20) 
Otolemur garnettii (21) 
Tupaia chinensis (22) 
Oryctolagus cuniculus (23) 
Mus musculus (24) 
Rattus norvegicus (25) 
Cricetulus griseus (26) 
Dipodomys ordii (27) 
Heterocephalus glaber (28) 
Ictidomys tridecemlineatus (29) 
Pteropus alecto (30) 
Eptesicus fuscus (31) 
Myotis lucifugus (32) 
Myot/s brandtii (33) 
Myot/s doWd/V (34) 
Fefe cotws (35) 
Con/s lupus familiaris (36) 
Ailuropoda melanoleuca (37) 
Leptonychotes weddellii (38) 
Odobenus rosmarus divergens (39) 
Mustela putorius furo (40) 
fquus caballus (41) 
Ceratotherium simum simum (42) 
Camelus ferus (43) 
Sus scro/o (44) 
Capra hircus (45) 
OWs ories (46) 
Pantholops hodgsonii (47) 
Bos tourus (48) 
Orcinus area (49) 
Tursiops truncatus (50) 
Neophocaena phocaenoides (51) 
Lipotes vexillifer (52) 
Physeter macrocephalus (53) 
Balaenoptera acutorostrata (54) 
Balaenoptera physalus (55) 
Erinaceus europaeus (56) 
Loxodonta africana (57) 
Trichechus manatus latirostris (58) 
Elephantulus edwardii (59) 
Chrysochloris asiatica (60) 
Echinops telfairi (61) 
Dasypus novemcinctus (62) 
Monodelphis domestica (63) 
Sarcophilus harrisii (64) 
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Old World monkeys 



New World monkeys 
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Perissodactyls 



Ruminants 
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Figure 1. Phylogenetic tree of the species investigated in this study. The major groups are indicated either on the branch or to the right. 
Species with a mutated M0XD2 gene are indicated by color: red, gene loss by gene deletion, exon deletion, or ORF-disrupting mutations; blue, C- 
terminal truncation due to a 13-nt deletion in the last exon. Numbers in parentheses correspond to those in Table 1. K-Pg, Cretaceous-Paleogene 
boundary. 

doi:10.1371/journal.pone.0104085.g001 



deletion event must have preceded the divergence of these genera, 
which occurred more than 8 million years ago [27,32]. 

Effect of the GPI anchor signal loss on M0XD2 protein 
localization in the Old World cercopithecine monkeys 

We previously showed that the rhesus macaque, an Old World 
cercopithecine monkey species, had a 13-nt deletion in the last 
exon that resulted in C-terminal truncation and loss of the GPI 
anchor signal [14]. In this study, we identified the M0XD2 genes 
from three additional cercopithecine monkey species, namely, the 
crab-eating macaque, the olive baboon, and the green monkey. AU 
of these species exhibited a 13-nt deletion in the last exon, 
indicating that the deletion was present in a common ancestor of 
these four species (Figure 3). However, following the 13-nt 
deletion, the coding sequence integrity was maintained up to 
about 1 3 million years after a common ancestor of these species 
might have lived [27]. Because the deletion occurred in the last 
exon and there are no other deleterious mutations, the 



cercopithecine monkey M0XD2 genes still produce functional 
proteins. 

The 1 3-nt deletion described above results in a loss of the GPI 
anchor signal of the MOXD2 protein, which is required for 
membrane attachment to either the inner aspect of the vesicular 
membrane or outer surface of the plasma membrane [14]. 
Therefore, the cercopithecine monkey MOXD2 proteins do not 
localize on the membrane. Instead, mutation results in a soluble 
protein that is localized within vesicles and/ or secreted from cells. 
The DBH gene, a paralog of M0XD2, encodes both soluble and 
membrane-bound isoforms of dopamine- P-hydroxylase within 
vesicles [17]. If the MOXD2 protein is contained in the vesicles 
as DBH, the soluble cercopithecine monkey MOXD2 may 
maintain the original molecular function and phenotype. 

Inactive M0XD2 genes in toothed whales and baleen 
whales 

We examined genome assemblies and WGS reads of seven 
whale species. Five were toothed whale species: the killer whale 



B 



rhesus macaque northern white-cheeked western lowland chimpanzee human 

(rheMao3 chr3:181494715-181540S86) gibbOH gOrllla (panTro4 ohr7:143686156-143725295) (hg19 ohr7:141922621-14195ai20) 

{nomLeu3 chr13:93611475-93640088) (gorGorS Chr7:140663113-140702312) 



eastern hoolock gibbon 

silvery gibbon 

pileated gibbon 

siamang 



ililll 11 i 



— Bornean orangutan 
chimpanzee 



(panTro4 chr 7: 1 436861 56-1 43725295) 



M0XD2 



PRSS58 



Figure 2. Complete deletion of the MOXD2 gene in gibbons. (A) Dot plots between the olive baboon MOXD2-PRSS58 genomic locus and 
orthologous regions from five other primate species (the rhesus macaque, the northern white-cheeked gibbon, the western lowland gorilla, the 
chimpanzee, and the human) are depicted. The northern white-cheeked gibbon M0XD2 gene is replaced by a translocated DNA fragment (red box). 
The locations of the M0XD2 and PRSS58 genes are indicated at the right (downward arrows), with coding regions and noncoding regions (introns and 
untranslated regions) in green and yellow, respectively. The human genomic segment containing M0XD2 exons 12 and 13 was deleted after the 
human-chimpanzee divergence (red vertical line). Gray vertical lines in the rhesus macaque and the northern white-cheeked gibbon genomes mark 
short assembly gaps. Blue arrows indicate chromosome direction (from p- to q-telomere). (B) Deletion of M0XD2 gene in four other gibbon species. 
WGS reads from the hoolock gibbon, the silvery gibbon, the pileated gibbon, the siamang, and the Bornean orangutan were mapped to the repeat- 
masked chimpanzee MOXD2-PRSS58 genomic sequence. Where matched WGS read positions are marked with dots, the M0XD2 gene region (red box) 
lacked matching gibbon WGS reads but not orangutan WGS reads, indicating that the region is missing in these four gibbon species. The locations of 
the M0XD2 and PRSS58 genes are indicated at the bottom with coding and noncoding regions in green and yellow, respectively. Parentheses indicate 
the genome assembly version and position for each species available from the UCSC Genome Browser database. 
doi:1 0.1 371/journal.pone.01 04085.g002 
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Chimpanzee 
Rhesus macaque 
Crab-eating macaque 
Olive baboon 
Green monkey 
Squirrel monkey 
Common marmoset 



GAGGTGGTGGAAAACACAACAGGCTGGATTCCAGACATCATCCCTACTCTCCGGGGTCCCTGCTTGGAGTCCTCCGGAGGCAAAGTGGAGGCCCAGGACA 100 

GAGGTGGTGGAAAACACAACAGGCTGGATTCCAGACATCATCCCTACTCCCCGGGGGCCCTGCTTGGAGTCCTTGGGAGGCAAAGTGGAGGCCCACGACA 180 

GAGGTGGTGGAAAACACAACAGGCTGGATTCCAGACATCATCCCTACTCCCCGGGGGCCCTGCTTGGAGTCCTTGGGAGGCAAAGTGGAGGCCCACGACA 100 

GAGGTGGTGGAAAACACAACAGGCTGGATTCCAGACATCATCCCTACTCCCCGGGGGCCCTGCTTGGAGTCCTTGGGAGGCAAAGTGGAGGCCCAGGACA 100 

GAGGTGGTGGAAAACACAACAGGCTGGATTCCAGACATCATCCCTACTCCCCGGGGGCCCTGCCTGGAGTCCTTGGGAGGCAAAGTGGAGGCCCAGGACA 100 

GAGGTGGTGGAAAACACAACAGGATGGATTCCAGACATCAGCCCTGCTCCCCAGGGGCCCTGCTTGGAGTCCTCTGGAGGCAAAGTGGAGGCCCAGGACA 100 

GAGGTGGTGGAAAACACAACGGGATGGATTCCAGACATCATCCCTACTCCGCTGGGGCCCTGCTTGGAGTCCTCTGGAGGCAAAGTGGAGGCCCAGGACA 100 



Chimpanzee 
Rhesus macaque 
Crab-eating macaque 
Olive baboon 
Green monkey 
Squirrel monkey 
Common marmoset 



AGACCTCTGCAGGCTTCAG AGCTGCACCTGTA GCCCTCTCAGGCTCTAGCAGGGCCACCCTGAGGCATCTCCCCCTGGCGGCCGTCCTGCTGGTGCAGCG 200 

AGACCCCTGCAGGCTTCAG^^^'^^^^^GCCCTCTCAAGCTCTAGCAGGGCCACCCTGAggcatctctccctggcggccgtcctgctggtgcagcg 187 

AGACCCCTGCAGGCTTCAG^^ GCCCTCTCAAGCTCTAGCAGGGCCACCCTGAggcatctctccctggcggccgtcctgctggtgcagcg 187 

AGACCCCTGCAGGCTTCAG^^^™™|jGCCCTCTCAAGCTCTAGCAGCGCCACCCTGAggcatctccccctggcggccatcctgctggtgcagcg 187 

AGACCCCTGCAGGCTTCAG^^^^^^BGCCCTCTCAAGCTCTAGCAGGGCCACCCTGAggcatctccccctggcggccgtcctgctggtgcagcg 187 

AGACCCCTGCGGGCTTCAGGGCTTCACCCGTGGCGCTCTCGAGCTCCAGCAAGGCCACCCTGAGGCACCTCCTGCTGGCAGCTGTCCTACTGGTGCAGAG 200 

AGACTCCTGCGGGCTTCAGGTCTTCACCTGTGGCCCTCTCGAGCTCCAGCAAGGCCACCCTGAGGCGTCTCCCGCTGGCAGCTGTCCTGCTAGTGCAGAG 200 



Chimpanzee 
Rhesus macaque 
Crab-eating macaque 
Olive baboon 
Green monkey 
Squirrel monkey 
Common marmoset 



CACACTCTCCTGGCTCCTGGCCATGCTGCAGACTGGAGTCTGA 243 

cacactctcctggctcctggccatgccgcagactggagtctga 230 

cacactctcctggctcctggccatgccgcagactggagtctga 230 

cacactctcctggctcctggctacgccgcagactggagtctga 230 

cacactctcctggctcctggctatgccgcagactggagtctga 238 

CACACTGTCCTGGCTCCTGGCCACGCTGCAGACTGGAGTCTGA 243 

CACACTCTCGGGGCTCCTGGCCATGCTGCAGACTAGAGTCTGA 243 



Figure 3. A 1 3-nt deletion in Old World cercopithecine monkey MOXD2exon 1 3. The M0XD2 exon 1 3 sequences from the chimpanzee {Pan 
troglodytes), the rhesus macaque {Macaca mulatto), the crab-eating macaque {Macaca fascicularis), the olive baboon {Papio anubis), the green 
monkey (Chlorocebus sabaeus), the squirrel monkey (Saimiri boliviensis), and the common marmoset (Callithrix jacchus) were subjected to multiple 
alignment. The 13-nt deletion, which was common to the four cercopithecine monkeys, is highlighted in red. Stop codons are highlighted in yellow. 
Note the premature stop codons present in the cercopithecine monkey M0XD2 genes caused by a frameshift deletion. Uppercase and lowercase 
letters indicate coding and noncoding sequences, respectively. 
doi:1 0.1 371/journal.pone.01 04085.g003 



{Orcinus area), the bottlenose dolphin (Tursiops truncatus), the 
fmless porpoise [Neophocaena phocaenoides), the baiji [Lipoles 
vexillifer), and the sperm whale (Physeter macrocephalus). The 
other two were baleen whale species: the minke whale [Balae- 
noptera acutorostrata) and the fin whale [Balaenoptera physalus). 

The complete exon sequences of the M0XD2 gene were 
identified in six of these whale species; the sperm whale M0XD2 
gene was deleted (see below). Upon analyzing the exon sequences 
of M0XD2 genes of the six whale species, we found many ORF- 
disrupting mutations, including frameshift insertions and deletions, 
nonsense mutations, and splice site mutations (see Table 1 for 
details). For example, there was a 4-nt deletion in exon 2 that was 
shared among the four toothed whale species, and an inframe 
nonsense codon ("TAA") that was common to the two baleen 
whale species (Figure 4), indicating that these mutations appeared 
in a common ancestor of each lineage. There were additional 



mutations in many exons, indicating that M0XD2 genes of these 
whale species are non-functional pseudogenes (Figure S3). 

When we searched the sperm whale WGS contigs for the 
M0XD2 genomic sequence, there were no matching sequence 
data. To determine if the M0XD2 gene was deleted in the sperm 
whale genome as in gibbons, the sperm whale WGS contigs were 
searched for the PRSS58 gene, which is located close to the 
M0XD2 gene in other mammalian species. We were able to locate 
the sperm whale PRSS58 genomic locus, which was orthologous 
to the MGAM-MOXD2-PRSS58 genomic loci of the cow {Bos 
taurus) and the bottlenose dolphin. Upon comparing the MGAM- 
MOXD2-PRSS58 genomic loci of the sperm whale and the 
bottlenose dolphin, we found that the M0XD2 gene was absent 
from the sperm whale genome (Figure 5). As in gibbons, the sperm 
whale M0XD2 gene locus was replaced with a repeat-rich DNA 
fragment. 



Pig (aa) 
Cow (aa) 
Pig 
Cow 

Killer whale 
Bottlenose dolphin 
Finless porpoise 
Baiji 

Minke whale 
Fin whale 



DQHVVDEDTLEEDGSQDAELQGLTEDAVYTTMRFS 
DQHLVDENTLEEDGSQDAELQGLTEDAVYTTMRFS 

cccagGATCAGCATGTGGTGGATGAAGACACCCTGGAAGAGGACGGGAGCCAGGACGCGGAGCTGCAGGGGCTGACGGAAGACGCCGTCTACACCACCATGCGCTTCTCC 110 

cccagGACCAGCACCTGGTGGATGAAAACACGCTGGAGGAGGATGGGAGCCAGGATGCAGAACTGCAGGGGCTAACAGAAGATGCCGTCTACACCACCATGCGCTTCTCC 110 

cctagAATCAGCACCTGGTGGATGAAGACGCTCTGGAGGAGGMBGAGCCAGAACGCAGAGCTGCAGGCACTGACAGAAGACACCATCTATACCACCATGCGCTTCTCC 106 

CCtagAATCAGCACCTGGTGGATGAAGACGCTCTGGAGGAGG^HGAGCCAQGACGCAGAGCTGCAGGCACTGACAGAAGACACCATCTATACCACCATGTGCTTCTCC 186 

cctagAATCAGCACCTGGTGGATGAAGACGCTCTGGGGGAGG^HGAGCCAGGACGCAGAGCTGCTGGCACTGACAGAAGACACCATCTATACCACCATGCGCTCCTCC 186 

cctagAATCACCACCTGGTGGATGAAGACGCTCTGGAGGAGGj^BGAGCCAGGATQCGGAGCTGCAGCCACTGACAGAAGACACCATCTATACCACCATGCGCTTCTCC 186 

cccagGATGAGCACCTGGTAGACGAAGACGCTCTGGAGGAGGACGAGAGCCAGGACGCGGAGCTGCTGGCACTGACAGAAGATGCCATCTATACCACGATGCGCTTCTCC 110 

cccagGATGAGCACCTGGTAGACGAAGATGCTCTGGAGGAGGACGGGAGCCAGGACGCGGAGCTGCTGGCACTGACAGAAAACGCCATCTATACCACCATGCACTTCTCC 110 



Pig (aa) 
Cow (aa) 
Pig 
Cow 

Killer whale 
Bottlenose dolphin 
Finless porpoise 
Baiji 

Minke whale 
Fin whale 



RPFRSCDPQDQDIT 
RPFRSCDPHDQDIT 

AGGCCCTTCCGCTCCTGCGACCCCCAGGACCAAGACATCACGgtaaa 157 

AGGCCCTTCCGCTCCTGCGACCCTCACGACCAAGACATCACGgtaaa 157 

AGGCCCTTCTGCTCCTGTGACCCTCAAGACCAAGACATCACGgtaaa 153 

AGGCCCTTCTGCTCCTGTGACCCTCAAGACCAAGACATCACGgtaaa 153 

AGGCCCTTCCGCTCCTGCGACCCTCAAGACCAAGACATCACGgtaaa 153 

AQCCCCTTCCACTCCTGTGACCCTCAAGACCAAGACATCACGgtaaa 153 

AGGCCCTTCCGCTCCTGTGACCCTCATGACTAAGACATCACGgtaaa 157 

AGGCCCTTCCGCTCCTGTGACCCTCATGACJSiCACATCACGgtaaa 157 



Figure 4. Disruptive mutations of the MOXD2 gene in whales. Exon 2 sequences of the pig (Sus scrofa), the cow (Bos Taurus), the killer whale 
(Orcinus orca), the bottlenose dolphin (Tursiops truncatus), the finless porpoise (Neophocaena phocaenoides), the baiji (Lipotes vexillifer), the minke 
whale (Balaenoptera acutorostrata), and the fin whale (Balaenoptera physalus) M0XD2 genes are shown. Disruptive mutations, a 4-nt deletion in 
toothed whales, and an inframe nonsense codon in baleen whales, are highlighted in red. Exon and intron sequences are in uppercase and lowercase 
letters, respectively. Additional mutations in other exons are presented in Figure S3. 
doi:1 0.1 371/journal.pone.01 04085.g004 
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sperm whale 

(AWZP01010387:2819-58326) 

Figure 5. Complete deletion of the MOXD2 gene in the sperm whale. Sequence comparison of tine sperm whale {Physeter macrocephalus) 
(horizontal) and the bottlenose dolphin (Tursiops truncatus) (vertical) M0XD2 gene loci is presented. In the sperm whale, the M0XD2 genomic region 
was replaced with a repeat-rich segment (red box). The bottlenose dolphin MGAM, M0XD2, and PRSS58 genes are marked at the right, with coding 
regions and introns in green and yellow, respectively. Gray horizontal lines indicate gaps in the bottlenose dolphin genome assembly "turTru2". 
doi:1 0.1 371/journal.pone.01 04085.g005 



The large number of deleterious mutations and gene deletion in 
both toothed whales and baleen whales clearly indicated that the 
M0XD2 gene lost its fijnctionality during the evolution of whales. 
The habitat and ecology of whales may have been associated with 
relaxation of selection pressure on M0XD2 gene, which has 
subsequently allowed it to degenerate or be deleted. 

Evolutionary analysis of catarrhlne M0XD2 

Codeml analyses and likelihood ratio tests were performed in 
the same manner as in Jiang et al.'s study [33], which reported 
frequent and independent inactivation of the type 1 taste receptor 
Taslr2/Taslr3 in carnivorans. To evaluate whether selective 
pressure was relaxed on humans and orangutans with an 
inactivated M0XD2, and possibly on gorillas and chimpanzees 
with an intact M0XD2, we performed likelihood ratio tests 
(Tables 2 and 3). First, the average CO across the 1 3 primate species 
was estimated to be 0.18967 (one-ratio model A in Table 2), which 
was significantly different from 1 (one-ratio model B) 
(P = 2.34x10"'*; B vs A in Table 3), indicating an overaU 
purifying selection in species of the data set. 

To test whether the selection pressure was relaxed on humans 
and orangutans, a two-ratio model was tested (model C in 
Table 2), with the assumption of a uniform (0 for human and 
orangutan branches (col), and for other branches (ff)0), respective- 
ly. In this model, coO was estimated to be 0.16649, similar to that 
estimated in model A. However, col was estimated to be 0.70206, 
indicating that selection pressure on human and orangutan 



M0XD2 was relaxed. This two-ratio model C fitted significandy 
better than one-ratio model A (P = 7.57 x 10"''; A vs C in Table 3), 
also implying that the selection pressure was relaxed in the human 
and orangutan branches. Furthermore, when we tested another 
two-ratio model (model D in Table 2) that allowed col (humans 
and orangutans) to be fixed to 1, and assumed a uniform (oO for 
other branches, this model was not significandy different from 
model C (P = 0.2796; D vs C in Table 3), consistent with the 
scenario of relaxed selection pressure on human and orangutan 
M0XD2 (col = 1). An alternative model (model L in Table 2), in 
which (D was allowed to vary among branches, was not 
significandy different from the two-ratio model C (P = 0.2079; C 
vs L in Table 3). Figure 6A shows values of co on each branch, as 
estimated by model L. 

To examine whether selection pressure was also relaxed on 
gorillas, which had an intact M0XD2 coding sequence, a two- 
ratio model (model E in Table 2) was tested, with the assumption 
of a uniform <o for human, gorilla, and orangutan branches (ff)l), 
and for other branches (coO), respectively. In this model, coO was 
estimated to be 0.15984, slightly smaller than that estimated in 
model C; (ol was estimated to be 0.77453, slightly larger than that 
in model C. This two-ratio model E also fitted significandy better 
than the one-ratio model A {P = 1.18x10"'; A vs E in Table 3), 
suggesting that selection pressure on gorilla M0XD2 was also 
relaxed. Another two-ratio model (model F in Table 2) that 
allowed col (humans, gorillas, and orangutans) to be fixed to 1, and 
assumed a uniform coO for other branches, showed no difference 
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Table 2. Models and parameter estimates for likelihood ratio tests of selective pressure on catarrhlne M0XD2 gene^ 





Model 


Description 


0) (dN/dS) 




InL" 


np' 


A 


All branches have the same &0 


(dO 


= 0.18967 




—4803.2004 


25 


B 


All branches have the same 0^0 = 1 


coO 


= 1 




-4969.6092 


24 


C 


The human and orangutan branches have col; other branches have aO 


coO 


= 0.16649, col 


= 0.70206 


-4793.1791 


26 


D 


The human and orangutan branches have col = 1; other branches have coO 


0)0 


= 0.16660, (Bl 


= 1 


-4793.7635 


25 


E 


The human, gorilla, and orangutan branches have col; other branches have coO 


coO 


= 0.15984, col 


= 0.77453 


-4789.1776 


26 


F 


The human, gorilla, and orangutan branches have col = 1; other branches have o^O 


coO 


= 0.15994, col 


= 1 


—4789.5358 


25 


G 


The great ape branches and the ancestral branch leading to great apes have col; 
other branches have aO 


coO 


= 0.16431, (Bl 


= 0.39834 


-4796.8743 


26 


H 


The great ape branches and the ancestral branch leading to great apes have col =1; 
other branches have oiO 


coO 


= 0.16451, col 


= 1 


-4804.5961 


25 


1 


The human, gorilla, orangutan, and cercopithecine monkey branches, and the ancestral 
branch leading to cercopithecine monkeys have the col; other branches have aO 


coO 


= 0.15182, col 


= 0.43710 


-4792.2753 


26 


J 


The human, gorilla, orangutan, and cercopithecine monkey branches, and the ancestral 
branch leading to cercopithecine monkeys have 03l = 1; other branches have 03O 


coO 


= 0.15199, col 


= 1 


-4800.1977 


25 


K 


The human, gorilla, and orangutan branches have the 03I; the cercopithecine monkey branches 
and the ancestral branch leading to cercopithecine monkeys have the o)2; other branches have coO 


coO 
co2 


= 0.15194, (Bl 
= 0.24051 


= 0.77665, 


-4788.1184 


27 


L 


Each branch has Its own o) 


variable co^ 




-4780.1982 


47 



^The sequence data file, tree files, control files, and main result files for the codemi analyses were provided in Data S2. 
"^The natural logarithm of the likelihood value. 
'^Number of parameters. 
^See Figure 6A. 

doi:1 0.1 371 /journal.pone.Ol 04085.t002 



from model E (P = 0.3973; F vs E in Table 3), consistent with the 
scenario of relaxed selection pressure on gorilla M0XD2 (col = 1). 

We further tested whether selection pressure was relaxed before 
the divergence of great apes, including chimpanzees and bonobos, 



which had intact M0XD2. We tested a two-ratio model (model G 
in Table 2) with the assumption of a uniform (B for the great ape 
branches and the ancestral branch leading to great apes ((ol), and 
for other branches (coO), respectively. In this model, coO was 



Table 3. Likelihood ratio tests of selective pressure on catarrhlne M0XD2 gene^. 



Models compared 


2A(ln D" 


df' 


P value 


Significance level'' 


B vs A 


332.818 




2.34x10"'" 


*** 


A vs C 


20.0427 




7.57x10""^ 


*** 


D vs C 


1.1689 




0.2796 


ns 


C vs L 


25.9617 


21 


0.2079 


ns 


A vs E 


28.0456 




1.18x10"' 


www 


F vs E 


0.7165 




0.3973 


ns 


E vs L 


1 7.9588 


21 


0.6516 


ns 


A vs G 


12.6522 




3.75x10"" 


www 


H vs G 


15.4436 




8.50x10"= 


www 


G vs L 


33.3522 


21 


0.04245 


w 


A vs 1 


21.8502 




2.95x10"'* 


www 


J vs 1 


15.8447 




6.88x10"= 


www 


1 vs L 


24.1542 


21 


0.2856 


ns 


J vs L 


39.9989 


22 


0.01081 


w 


A vs K 


30.1641 


2 


2.82x10' 


www 


E vs K 


2.1185 


1 


0.1455 


ns 


Kvs L 


15.8403 


20 


0.7265 


ns 



^Models in Table 3 were compared using likelihood ratio test. 

^Twice the difference in log likelihood values between the two models compared. 

*^Degree of freedom. 

d.««_ p<o.ooi; ••, P<0.01; *, P<0.05; ns, not significant. 
doi:l 0.1 371/journal.pone.0104085.t003 
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Figure 6. Molecular evolution of the MOXD2 gene in catarrhines and whales. IVIolecular evolutionary histories of the M0XD2 gene in 
catarrhines (A) and whales (B) are presented. Gene-disrupting mutations are indicated with arrowheads. dN/dS ratios (to) estimated by the "free-ratio" 
model are presented on each branch. Asterisks indicate species of which coding sequences were used for the dN/dS analysis. e#, exon number; ins, 
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insertion (followed by number of inserted residues); del, deletion (followed by number of deleted residues); ns, nonsense mutation; sa, splice 
acceptor mutation; sd, splice donor mutation. 
doi:1 0.1 371/journal.pone.01 04085.g006 



estimated to be 0.16431; col was estimated to be 0.39834, smaller 
than those estimated by models C and E. This two-ratio model G 
was also significantly diSerent fi-om the one-ratio model A 
(P = 3.75x10"*; A vs G in Table 3), but the difference between 
these models was less than that observed between models F and A. 
Another two-ratio model (model H in Table 2) that allowed col 
(great ape branches and the ancestral branch leading to great apes) 
to be fixed to 1, and a uniform ooO for other branches, was 
significantly different fi-om model G (P = 8.50x10 '; H vs G in 
Table 3), implying that chimpanzee M0XD2 was under selection. 
Therefore, it is highly likely that the M0XD2 gene is stiU 
fijnctioning in chimpanzees and bonobos but not in humans, 
gorillas, orangutans, and gibbons. 

To test whether the selection pressure was also relaxed in Old 
World cercopithecine monkeys as in humans, orangutans, and 
gorillas, a two-ratio model was tested (model I in Table 2), with the 
assumption of a uniform (O for human, gorilla, orangutan, and 
cercopithecine monkey branches, and the ancestral branch leading 
to cercopithecine monkeys (rol), and for other branches (tt)0), 
respectively. In this model, coO was estimated to be 0.15182, 
similar to that estimated in model E. However, rol was estimated 
to be 0.43710, largely smaller than that estimated in model E, 
implying that the selection pressure on cercopithecine M0XD2 
was not relaxed. Furthermore, when we tested another two-ratio 
model (model J in Table 2) that allowed rol to be fixed to 1, and 
assumed a uniform roO for other branches, this model was 
significantly different fi'om model I (P = 6.88x10 ^; J vs I in 
Table 3), fiirther indicating that cercopithecine M0XD2 was 
under selective pressure. Finally, we tested a three-ratio model 
(model K in Table 2), with the assumption of a uniform ro for the 
human, gorilla, and orangutan branches (col), for the cercopithe- 
cine monkeys and the ancestral branch leading to cercopithecine 
monkeys (ffl2), and for other branches (roO), respectively. In this 
model, roO and rol were estimated to be 0.15194 and 0.77665, 
similar to those estimated in model E; co2 was estimated to be 
0.24051, indicating that cercopithecine monkey M0XD2 genes 
are under selective pressure. This model (K) showed no difference 



from model E (P = 0.1455; E vs K in Table 3), consistent with the 
scenario that cercopithecine M0XD2 genes are under selection. 

Therefore, it is likely that the M0XD2 genes are still functioning 
in chimpanzees, bonobos, and Old World cercopithecine mon- 
keys, and that it was independently inactivated in humans, gorillas, 
orangutans, and gibbons. 

Evolutionary analysis of whale M0XD2 

Whale M0XD2 genes have many disruptive mutations, some of 
which are shared by two or more species. Three mutations 
occurred before the divergence of toothed whales examined in this 
study, and one mutation occurred before the divergence of the two 
baleen whales. However, there were no ORF-disrupting mutations 
shared by both the toothed whales and the baleen whales, 
implying that the gene might be intact in the common ancestor of 
the extant whales. 

To verify whether the selective pressure on whale M0XD2 was 
relaxed, we performed likelihood ratio tests (Tables 4 and 5). The 
average CO across the sbc whales, cows, and pigs was estimated to be 
0.31510 (one-ratio model A in Table 4), which was significandy 
different from 1 (one-ratio model B) (P= 1.90x10"'^*; B vs A in 
Table 5), and indicated an overall purifying selection in species of 
this dataset, probably in cows and pigs. 

Because there were no shared mutations between the toothed 
whales and the baleen whales, we tested two hypotheses using 
separate two-ratio models (models C and E in Table 4). In model 
C, a uniform ro for the toothed whale branches, the ancestral 
branch leading to toothed whales, the baleen whale branches, and 
the ancestral branch leading to baleen whales (rol) and for cow 
and pig branches and the ancestral branch leading to whales (roO), 
respectively, was assumed. To examine whether the selection 
pressure was relaxed in the ancestral branch of both whale clades, 
we tested a two-ratio model (model E in Table 4) with the 
assumption of a uniform ro for all the whale branches and the 
ancestral branch leading to whales (rol), and for other branches 
(pigs and cows) (roO), respectively. 



Table 4. Models and parameter estimates for likelihood ratio tests of selective pressure on whale M0XD2 gene^. 





Model 


Description 


CO (dN/dS) 


InL" 


np' 


A 


All branches have the same (oO 


coO = 0.31510 


-5047.2740 


15 


B 


All branches have the same coO = 1 


coO= 1 


-5122.1820 


14 


C 


The toothed whale branches, the branch leading to toothed whales, the baleen whale branches, 
and the ancestral branch leading to baleen whales have col; other branches have coO 


coO = 0.1 5926, col =0.74234 


-5013.9409 


16 


D 


The toothed whale branches, the branch leading to toothed whales, the baleen whale branches, 
and the ancestral branch leading to baleen whales have o>1 = 1; other branches have coO 


coO = 0.1 5938 col = 1 


-5015.8933 


15 


E 


The whale branches and the ancestral branch leading to whales have the col; other branches 
have 0)0 


co0 = 0.12471 col =0.69721 


-5011.6733 


16 


F 


The whale branches and the ancestral branch leading to whales have col = 1; other branches 
have coO 


co0 = 0.12110 col =1 


-5014.6813 


15 


G 


Each branch has Its own co 


variable co^ 


-5006.3099 


27 



^The sequence data file, tree files, control files, and main result files for the codemi analyses were provided in Data S3, 
^he natural logarithm of the likelihood value. 
'^Number of parameters. 
■"See Figure 6B. 

doi:l 0.1 371/journal.pone.Ol 04085.t004 
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Table 5. Likelihood ratio tests of selective pressure on whale M0XD2 gene^. 



Models compared 


2A(ln D" 


dr 


lvalue 


Significance level'' 


B vs A 


149.8160 




1.90x10"" 


»»» 


A vs C 


66.6663 




3.22x10""^ 


*** 


D vs C 


3.9049 




0.04815 


» 


C vs G 


15.2620 


11 


0.1708 


ns 


A vs E 


71.2014 




3.22x10"'' 


»»* 


F vs E 


6.0161 




0.01418 


* 


E vs G 


10.7269 


11 


0.4664 


ns 



^Models in Table 4 were compared using lil<elihood ratio test. 

"^Twice the difference in log lil(elihood values between the two models compared. 

*^Degree of freedom. 

d,»» p<o.ooi; P<0.01; *, P<0.05; ns, not significant. 
doi:l 0.1 371 /journal.pone.Ol 04085.t005 



In model C, coO was estimated to be 0.15926, while col was 
estimated to be 0.74234, indicating that the selection pressure on 
M0XD2 of both whale clades was relaxed. This two-ratio model C 
fitted significantly better than the one-ratio model A 
(P = 3.22x10""*; A vs C in Table 5). When we tested another 
two-ratio model (model D in Table 4) that allowed col (both whale 
clades, the ancestral branch leading to toothed whales, and the 
ancestral branch leading to baleen whales) to be frxed to 1, and 
used a uniform coO for other branches, we observed that this model 
was marginally significandy different from model C (P = 0.04815; 
D vs C in Table 5). An alternative model (model G in Table 4) in 
which CO was allowed to vary among branches was not significandy 
different from the two-ratio model C (P = 0. 1 708; C vs G in 
Table 5). Figure 6B shows co estimated by the model G on each 
branch. 

In model E, coO and ff)l were estimated to be 0.12471 and 

0. 69721, respectively. This two-ratio model E was also significandy 
different from the one-ratio model A (P = 3. 23x10" ;AvsEin 
Table 5), suggesting that selection pressure on M0XD2 in the 
ancestral branch leading to whales was also relaxed. Another two- 
ratio model (model F in Table 4) that allowed col (all the whale 
branches and the ancestral branch leading to whales) to be fixed to 

1, and used a uniform coO for other branches, was marginally 
significandy different from model E (P = 0.01418; F vs E in 
Table 5), implying that the selective pressure on ancestral whale 
M0XD2 was partially relaxed. 

Therefore, it was not clear whether the selection pressure on 
whale M0XD2 had become relaxed in the common whale 
ancestor, or independently in the toothed whale ancestor and the 
baleen whale ancestor. The relaxation of selection pressure may 
have occurred close to when the two whale clades diverged. 

Discussion 

When organisms adapt to a particular environment where some 
of their traits are no longer of use, selection pressure on those 
phenotypes become relaxed, and the associated genes are free to 
accumulate deleterious mutations [34,35]. One example of this 
phenomenon is the diminishment of olfactory function in 
catarrhine primates and toothed whales [3,7,8]. In this study, we 
found that the M0XD2 gene was frequendy inactivated and 
altered in catarrhines and whales, which may be associated with 
the evolution of olfaction in both clades. 

The M0XD2 gene, which is strongly expressed in the olfactory 
epithelium, has been proposed to be associated with olfactory 



function, although no direct evidence supporting this hypothesis 
has been established [14]. The loss of the M0XD2 gene in gibbons 
is concordant with the trend of pseudogenization of this gene in 
humans and orangutans. Human M0XD2 is inactive due to an 
exon deletion, and orangutan M0XD2 has multiple nonsense 
mutations. Likelihood ratio tests have suggested that selection 
pressure on western lowland gorilla M0XD2 might be relaxed. 
However, statistical analyses have implied that chimpanzee, 
bonobo, and Old World cercopithecine monkey M0XD2 genes 
are under selective constraint. Thus, it is highly likely that the 
M0XD2 gene became of no use in some-but not aU-catarrhines, 
and as a result was inactivated during evolution. We hypothesize 
that loss of the M0XD2 gene, together with relaxed selection 
pressure on OR genes [4], may be associated with the altered 
olfaction in some catarrhines. 

Many mammals have secondarily returned to aquatic environ- 
ments, and acquired adaptive molecular changes in many proteins 
during evolution [36]. Whales are adapted to a fully aquatic 
lifestyle, where the detection of airborne odorants is improbable. 
Indeed, the olfactory apparatus is gready reduced in some baleen 
whales, and completely absent in toothed whales; consequently, 
many OR genes have become pseudogenized [7,8]. In this study, 
we found that whale M0XD2 genes are not functional due to 
multiple deleterious mutations and gene deletion. We hypothesize 
that loss of M0XD2 genes in whales may be associated with 
altered olfaction concomitant with the evolution of a fully aquatic 
lifestyle. 

It was previously shown that functional OR gene composition in 
mammals is related to ecological adaptation [1 1]. Thus, functional 
OR gene repertoires are reduced in aquatic mammals. In this 
study, we identified M0XD2 in three aquatic mammalian species 
in addition to whale species, namely, the WeddeU seal [Lepto- 
nychotes weddellii), the Pacific walrus {Odohenus rosmarus diver- 
gens), and the Florida manatee {Trichechm manatus latrirostris). 
The former two are partially aquatic and the latter is fully aquatic. 
Interestingly, the M0XD2 genes of these three species do not have 
any deleterious mutations, indicating that they might be function- 
al. 

Inactivation of the transient receptor potential cation channel, 
subfamily C, member 2 {TRPC2) gene in mammalian species 
exhibits a similar pattern to that of the M0XD2 gene. The 
TRPC2 gene is expressed in the vomeronasal organ, and is a 
crucial component of pheromone transduction [3 7] . TRPC2 is a 
pseudogene in Old World monkeys and apes [38], as well as in 
whales, including both baleen and toothed whales [39]. The 
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TRPC2 gene is inactive in two partially aquatic mammals, the 
harbor seal (Phoca vitulind) and the river otter {Lutra lutra), but is 
intact in the California sea lion [Zalophus calif ornianus), which are 
also partially aquatic, indicating that the gene was randomly lost in 
aquatic mammals. Frequent loss of genes expressed in olfactory 
epithelium (M0XD2) or vomeronasal organs {TRPC2) in catar- 
rhines and whales further demonstrates the diminished olfactory 
function in these clades, as well as relaxed selection pressure on the 
associated genes. 

Conclusions 

In summary, we analyzed the M0XD2 genes from 64 
mammalian species and identified loss-of-function mutations in 
humans, orangutans, gibbons, and whales (Figure 6). We hypoth- 
esize that the inactivation and alteration of the M0XD2 gene may 
be associated with the evolution of olfaction in catarrhine primates 
and whales. Molecular functional studies of the M0XD2 gene in 
model animals with highly sensitive olfaction, such as mice and 
dogs, may provide direct evidence of its role in olfaction. 

Supporting Information 

Figure SI Multiple sequence alignment of MOXD2 
protein sequences. Multiple sequence alignment was prepared 
using MLSCLE. Amino acid sequ(;nces that are the same as 
chimpanzees {Pan Iroglodyles) are marked by dots (.). Stop codons 
and alignment gaps are denoted by red asterisks (*) and hyphens 
(— ), respectively. The human MOXD2 lacks the C-terminal 
region due to deletion of exons 12 and 13. The whale sequences 
were not included due to multiple disruptive mutations. GlustalW 
consensus labels are shown below sequences: asterisks (*), identical; 
colons (:), highly conserved; dots (.), moderately conserved. 
(PDF) 

Figure S2 Sequence comparison of the chimpanzee (Pan 
troglodytes) and the northern white-cheeked gibbon 
{Nomascus leucogenys) MOXD2 gene loci. (A) Dot plot 
comparison of the chimpanzee (horizontal) and the northern 
white-cheeked gibbon (vertical) M0XD2 loci. In the northern 
white-cheeked gibbon, the M0XD2 genomic region was replaced 
with a translocated DNA s(;gmc'nt (red box). The chimpanzee 
M0XD2 and PRSS58 genes are marked at the top with coding 
and non-coding regions (introns and untranslated regions) in green 
and yeUow, respectively. A dotted horizontal line indicates a gap in 
the northern white-cheeked gibbon genome assembly "nomLeuS". 
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